Cognitive Load

2 items

Back of Mind · 2026-04-16 2026-04-17-w3

The Most Important Number

Dan Davies asks how many words of AI output a manager can actually verify per day before judgment silently degrades, and the honest answer is that almost no organization has tried to find out. The self-driving car literature documented this vigilance decrement precisely; the same cognitive dynamic applies to anyone reviewing model outputs at volume, and unlike physical fatigue it's invisible to the person experiencing it. The Anthropic alignment paper this week hit the same wall at the research level: automated generation scaled, evaluation didn't, and the production failure on Sonnet 4 is the visible edge of that gap. The WSJ piece shows what it looks like at the infrastructure level: reliability became the competitive moat the moment generation capacity exceeded the enterprise's ability to trust it. Organizations are measuring tokens per second and cost per query; the number that will actually constrain their AI leverage is one nobody is tracking.

Back of Mind 2026-04-16-3

The Most Important Number

Dan Davies identifies the number nobody wants to find: how many words of AI output can a manager verify per day before judgment silently degrades? The self-driving car literature already answered this for monitoring tasks; the same vigilance decrement applies to AI output review. Organizations will systematically overestimate their people's verification capacity, and unlike physical exhaustion, cognitive degradation is invisible to the person experiencing it. The binding constraint on AI leverage isn't generation capability; it's human verification throughput, and we're structurally incentivized never to measure it.